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Abstract 

We live in a world increasingly dominated by networks - com- 
munications, social, information, biological etc. A central at- 
tribute of many of these networks is that they are dynamic, 
that is, they exhibit structural changes over time. While the 
practice of dynamic networks has proliferated, we lag behind 
in the fundamental, mathematical understanding of network 
dynamism. Existing research on time-varying graphs ranges 
from preliminary algorithmic studies (e.g., Ferreira's work 
on evolving graphs) to analysis of specific properties such as 
flooding time in dynamic random graphs. A popular model 
for studying dynamic graphs is a sequence of graphs arranged 
by increasing snapshots of time. In this paper, we study the 
fundamental property of reachability in a time-varying graph 
over time and characterize the latency with respect to two met- 
rics, namely store-or-advance latency and cut-through latency. 
Instead of expected value analysis, we concentrate on char- 
acterizing the exact probability distribution of routing latency 
along a randomly intermittent path in two popular dynamic 
random graph models. Using this analysis, we characterize 
the loss of accuracy (in a probabilistic setting) between mul- 
tiple temporal graph models, ranging from one that preserves 
all the temporal ordering information for the purpose of com- 
puting temporal graph properties to one that collapses various 
snapshots into one graph (an operation called smashing), with 
multiple intermediate variants. We also show how some other 
traditional graph theoretic properties can be extended to the 
temporal domain. Finally, we propose algorithms for control- 
ling the progress of a packet in single-copy adaptive routing 
schemes in various dynamic random graphs. 

'Research was sponsored by the Army Research Laboratory and was ac- 
complished under Cooperative Agreement Number W91 1NF-09-2-0053. The 
views and conclusions contained in this document are those of the authors and 
should not be interpreted as representing the official policies, either expressed 
or implied, of the Army Research Laboratory or the U.S. Government. The 
U.S. Government is authorized to reproduce and distribute reprints for Gov- 
ernment purposes notwithstanding any copyright notation here on. 



1 Introduction 

We live in a world increasingly dominated by networks - com- 
munications, social, biological etc - imagine, for instance, 
an ad hoc infrastructureless communications network of con- 
stantly mobile soldiers. A central feature of many of these net- 
works is that they are dynamic, that is, they exhibit structural 
changes over time. While the practice of dynamic networks 
has proliferated, especially in the area of military communica- 
tions networks, we lag behind in the fundamental, mathemati- 
cal understanding of network dynamism. 

Time-varying graphs have been a topic of active research 
recently [5,9, 12, 19]. They are useful in the study of commu- 
nication networks with intermittent connectivity such as delay- 
tolerant networks 1 15 1 and even disruption-tolerant social net- 
works 1 14 1; duty cycling wireless sensor networks J3][4j[7), 
and the like. Existing research on time-varying graphs ranges 
from algorithmic studies on graph journeys fl2) to analysis of 
specific properties such as flooding time in dynamic random 
graphs (5]l9). Empirical simulation-based analysis of certain 
temporal graph properties such as temporal distance and tem- 
poral efficiency has also been a topic of recent research pT) . 

In this paper, we propose a model of time-varying graphs 
called Temporal Graphlets which are essentially a time-series 
of static graph snapshots. While similar models have been 
studied in the literature before, albeit with alternative names 
such as space-time graphs fl8) , we propose new research di- 
rections in temporal graph theory and present analytical results 
on two different aspects of this temporal graph model. 

First, a directed stacked graph is created from all the tem- 
poral snapshots of the time-varying graph and we show how 
certain standard graph theoretic properties such as reachabil- 
ity, connectivity, etc. can be extended to this model. Then we 
propose a technique named smashing for collapsing all or parts 
of the temporal graph and analyze how the reachability prop- 
erty is affected due to the loss of temporal ordering informa- 
tion. We also introduce an intermediate model of m-smashed 
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graphs which selectively collapse parts of the temporal graph 
while preserving the remaining stacked structure. We show 
how the degree of smashing can impact graph properties by 
means of a thorough comparative probabilistic analysis of the 
reachability property for the simple time-varying line network. 
This is potentially useful for online analysis of large temporal 
graphs where accuracy can be traded for speed and complexity. 

We study two different metrics for measuring latency in this 
paper: (a) Store-or-advance; and (b) Cut-through. In the for- 
mer, a message can be forwarded to only a neighbor in a unit 
time step, whereas in the latter, a message can be routed to 
any neighbor in the currently connected component instanta- 
neously. In this paper, we study theoretical aspects of reacha- 
bility in temporal graphs under various random edge-dynamics 
models. In particular, we characterize the exact probability 
distributions for latency (not just the first moment) and also 
a recursive form for message location in two popular dynamic 
random graph models for the dynamic line graph (or linear net- 
work topology), namely, the independent probabilistic model 
and the two-step Markov chain model. 

Finally, we propose an adaptive routing algorithm that min- 
imizes expected traversal time between a source and a desti- 
nation node in the independent probabilistic temporal graph 
model. 

This paper is organized as follows. Section [2] introduces 
deterministic and random models of temporal graphs. Sec- 
tion [3] presents results on the probabilistic analysis of latency 
along dynamically changing random paths in graphs. Section 
[4] presents stacked and smashed graph models for temporal 
graphs and presents comparative probabilistic analysis of la- 
tency under both models for time-varying random paths. Sec- 
tion [5] presents an adaptive routing algorithm in time-varying 
graphs. Section [6] concludes the paper with a discussion on 
future research directions. 

2 Models of Temporal Graphs 

Time-varying graphs occur commonly in the real world, and it 
is necessary to have mathematical models for their represen- 
tation. We first introduce a deterministic model for represent- 
ing a series of time-varying graphs, and propose two different 
models for routing in such graphs. We then propose enhance- 
ments to well known dynamic random graph models, which 
are used throughout this paper for analysis. 

2.1 Temporal Graphlets: A Deterministic 
Model of Dynamic Graphs 

Assume slotted time starting at time 0. Slot t starts just after 
time t — 1 and ends at time t. A Temporal Graphlet Sequence 
TGS(T U T 2 ) = {G(t) = (V (t), £(*))}, T x < t < T 2 is 
our basic deterministic model for a dynamic network and at- 
tempts to capture its space-time trajectory (see Figure[T|i. Each 



G(t) is referred to as a Temporal Graphlet or simply Graphlet. 
Alternate notations that we will use, depending on the empha- 
sis, include G(T\, T 2 ), G[l, T] (shifting the frame of reference 
maintains properties), G[T] (reference shifting is implied). 

While traditional graph theory only considers properties 
in the "horizontal" (space) dimension, we consider proper- 
ties across the "vertical" (time) dimension as well. For in- 
stance, u — y v is T-reachable iff there exists a sequence of 
edges (m,u 2 ), (u 2 ,u 3 ), ...(u m -i,u m ), u = Ui, v = u m and 
(ui,Ui+i) <E V(tj), 1 < i < m, tj > tj-i, 1 < t j < T. 

For example, in Figure [T] every graphlet is disconnected, 
but T-reachability holds for a — > f. Similarly, a T-cut is the 
removal of a set of vertices X c 7(1)U^(2)UV(3) . . .UV{t) 
that results in some u and v losing their T-reachability prop- 
erty. Special or restricted temporal graphlets are also possible, 
e.g., a T-fc-regular graph is one in which every node makes 
unique contact exactly k times during its lifetime. 

Assume a node v wants to send a message to a certain node 
u. At the beginning of a slot the node that has the message 
can store it or forward it to another neighboring node. At the 
end of the slot the graph may change according to the TGS. 
There are two models for measuring progress accomplished 
by a message under the circumstances. 

Definition 2.1. In the Store or Advance (SoAj model, a node 
can forward the message only to one of its direct neighbors, 
and that is assumed to take a time slot. Even if the neighbor's 
neighboring edges are active right now, one may not be able 
to avail those edges right away. Instead, one has to wait for 
at least one (generally more) time slot(s) until the message 
reaches the neighbor. 

Definition 2.2. In the Cut-through (CuTJ model, a node may 
send the message to any node in its connected component, 
and the entire connected component can be traversed instan- 
taneously or at least in a much shorter time scale than that of 
edge dynamics. 

While the SoA model finds more applications in most time- 
varying networks such as MANETs, DTNs, and social net- 
works [14 1, the CuT model is interesting in its own right, 
and has been proposed in certain applications in low latency 
MANET design (10). 

In Section |4] we show how the deterministic temporal 
graphlet model can be useful for extending static graph theo- 
retic properties to dynamic graphs. A related concept of slices 
has been proposed recently fT9) . They define coupling vari- 
ables between instances of the same node in consecutive slices. 
However, the focus of this work is on detecting communities 
over time. 

2.2 Stochastic Models of Dynamic Graphs 

Random graph models are very useful for studying a plethora 
of graph properties in a probabilistic sense. A classic exam- 
ple of random graphs is the family of Erdos-Renyi graphs 
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Figure 1: Temporal Graphlets for t = 1,2,3. V(l) = V(2) = 
V(3) = {a,b,c,d,e,f}. Although this figure does not illustrate 
it, vertex set need not be the same (nodes could be added or 
deleted) 

ER(n,p) which are static graphs on n nodes with any of the 
(™) edges existing with probability p. The probability of the 
existence of an edge is independent of that of another edge in 
the graph. Although too simplistic and perhaps unrealistic for 
many application scenarios, random graphs have played a big 
role in the development of a good understanding of key physi- 
cal phenomena such as phase transitions and percolation p3) . 

Researchers have proposed adding a time dimension to the 
static random graph model such that time is slotted and each 
edge in the graph exists in each time slot with probability p 
and does not exist with probability 1 — p flOj. We refer to this 
graph as the dynamic Erdos-Renyi graph. 

Definition 2.3. Dynamic ER{p) graphs: Gt which is the 
graph at the end of slot t and at the beginning of slot t + 1 
is drawn from the family of graphs ER(n,p). Go is the initial 
graph and Gt is the final graph if the time horizon ends at 
time T. 

Definition 2.4. Markovian (q,p) graphs: In this model of 
dynamic random graphs [9], each edge in Gt can be in 
one of two states, ON or OFF, and the probability distri- 
bution is governed by a two-state Markov chain. The tran- 
sition probabilities are given by P{OFF — » ON) = p, 
P(OFF -> OFF) = 1 - p, P(ON -> OFF) = q, and 
P(ON -> ON) = l-q. 

We propose a generic enhancement to these two dynamic 
random graph models. Instead of allowing a stochastic process 
to act on all of the possible (™) edges, we restrict it to act 
on only the edges in a given underlying graph, G u . Clearly, 
when G u = K n , the complete graph, these stochastic process 
applies to all possible edges, and then this is equivalent to the 
older model. 

Observation 2.1. The Markov (1,1) dynamic graph cor- 
responds to the family of perfectly alternating graphs, 



{Gt, Gt+i), such that Gt+i has all the edges that do not exist 
in G t , and vice versa. 

Observation 2.2. At any time slot, ifG u — K n , the (1 —p,p) 
Markov graph is equivalent to the dynamic ER{p) graph. 

Observation 2.3. Another special case is the (p,p)-stochastic 
model. Here, define p to be the stability factor. For small p, 
there are few changes from Gt to Gt+i and the graph is stable. 
For large p, there could be many changes from G t to G t +\ 
and the graph is unstable. A special case of this special case 
is the (1, l)-stochastic model in which edges and non-edges 
alternate at each time slot. 

3 Analyzing Latency along Dynamic 
Paths 

Many routing schemes determine a path (say, according to 
a shortest path calculation), and then stay on that path even 
though it may be intermittently connected due to edges on it 
appearing and disappearing according to one of the aforemen- 
tioned stochastic processes. 

Hence we consider the simplest case which is amenable 
to mathematical analysis - the underlying graph G u = L n , 
the line graph with n vertices and n — 1 edges in which ver- 
tex 1 wants to send a message to vertex n. We denote these 
graphs by ER(n,p, L n ) and MC(q,p, L n ). Clearly a mes- 
sage should either be stored or be either advanced as much as 
possible (under the CuT model) or one hop per time slot (un- 
der the SoA model). 

We now study how random variables such as time taken to 
reach node n from node 1 behave as a function of n,p, q, G u . 
We first show how simple expected value analysis can yield 
first moments, and then characterize the entire probability dis- 
tributions as a function of such parameters. The results of 
this analysis will be applicable to the analysis of Temporal 
Graphlets in Section [4] 

3.1 The (1, l)-Stochastic Model 

For the (1, l)-stochastic model, one can compute the exact ar- 
rival time. Define a configuration as a binary string of length 
n — 1. If the i-th bit is 1 then the i-th edge on the line ex- 
ists otherwise it does not exist. For a given binary string B, 
let k(B) be the number of changes from to 1 or from 1 to 
and let b(B) be the value of the first bit of B. For example, 
fc(OOlllOOHOOl) = 5. 

Observation 3.1. The routing in the CuT model takes fc+1 — b 
slots. 

Observation 3.2. The routing in the SoA model takes 2n — 
k — b slots. 
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Corollary 3.1. The best configuration for CuT is 111 • • • I for 

which the routing takes slorj 

Corollary 3.2. The worst configuration for CuT is 0101 • • • 
for which the routing takes n — 1 slots. 

Corollary 3.3. The best configuration for SoA is 1010 • • • for 
which the routing takes n — 1 slots. 

Corollary 3.4. The worst configuration for SoA is 000 • • • 

for which the routing takes 2(n — 1) slots. 

We now compute the average routing time assuming a uni- 
form distribution for all the 2 I1_1 configurations. 

Observation 3.3. The average routing time for CuT is § (n — 
1) slots. 

Observation 3.4. The average routing time for So A is Mil — 

1) slots. 



3.2 The (1 - p, p) -Stochastic Model 

This is equivalent to the ER(n,p, L n ) model. We first begin 
with computation of expected values of advancement of a mes- 
sage until it hits a non-edge and the expected routing latency. 
Subsequently we derive the exact probability distributions of 
the spatio-temporal location of the message as well the dis- 
tribution of the routing latency under both the SoA and CuT 
models. 

Observation 3.5. In SoA the expected advance is p • 1 + (1 — 
p) ■ = p. 

Observation 3.6. In CuT the expected advance is upper- 
bounded by (1 — p) 'Y^Li i-P 1 = TZp* 

The following corollaries follows since the length of the 
route is n — 1. 

Corollary 3.5. In SoA the expected time for the routing time 
is — 




Figure 2: Probability distribution of the packet as function of 
space and time for ER(n — 10, p = 0.25, L n ) for t = 30 



Let Nt be a random variable denoting the node that the 
packet has reached at time t, and be the fc-th edge. 

P(N t = k)=P{N t -! = k — l)P(e fc _i) + P(N t -i = fc)P(gfe) 
=P(JV t _i =k — l)p + P(Nt-i = k)(l-p) (1) 



It is difficult to solve the above bivariate recurrence to attain 
a closed form for P(N t — k), hence we compute the probabil- 
ities numerically. Figure [2] shows an example of a probability 
distribution for a small line graph. The example considers a 
line graph on n — 10 nodes for p = 0.25. It is easy to see that 
since the expected waiting time for every hop is |, each hop 
takes approximately 4 time slots to traverse. Hence at t = 20, 
the packet would have traversed a mean of 5 hops, which is 
indicated in the figure. 

Let T be a random variable denoting the number of time 
slots needed for a packet to reach from node 1 to node n. It is 
easy to see that P(T < n — 1) = since it takes at least n — 1 
slots to reach node n. The general distribution of T is given 
by the following: 



Corollary 3.6. In CuT the expected time for the routing time 

(n-l)(l-p) 
P 



IS 



n + j-2 



P(T = n-l+j)= (" ' J ")(l-p) J *p n_1 ,Vj>0 (2) 



SoA latency Consider an Erdos-Renyi line graph on n nodes 
which denoted by ER f (n, p, L n ) at the t-th time instant. There 
are a maximum of n — 1 edges in this graph, and at each time 
instant, each edge exists with probability p. We want to send 
a packet from node 1 to node n; if an edge (u, v) is up at time 
instant t, and u has the packet, then it will transmit to v in that 
instant, otherwise, it will hold it until a later time instant when 
the edge becomes active. We want to track the probability 
distribution of the packet over time as a function of n and p. 



1 This assumes that cutting through the network takes negligible time com- 
pared to waiting. 



This is because there are exactly j time slots when the 
packet has to wait at one of the nodes 1,2,3, ... ,n — 1, and 
there are (™ + j 2 ) number of ways of assigning these j slots to 
the n — 1 nodes. Figure [3]plots this distribution. 

It can easily be verified that E[T] = Y™- n kP(T 



-, which is in agreement with Corollary 
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CuT latency We now characterize the distribution of routing 
times in terms of the cut-through metric. It is assumed that the 
time taken to cut through the edges in a connected component 
do not cost any time slots and time elapses only due to waiting 
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Figure 3: Probability distribution of time taken to traverse 
the dynamic line graph ER(10,p, Liq) for values of p = 
{0.1,0.2,0.25,0.5} 

for an inactive link to become active^ 

Let T be the random variable denoting the number of time 
slots taken to reach node n from node 1 if nodes were for- 
warding the packet as much as possible toward the destination 
in the current connected component. 

P(T = fc)=Pr{Wait for k slots at {1, 2, . . . , n - 1}} 

+ k-2 
k 



{l-pfp 



k^n— 1 



(3) 



This is because the number of ways of assigning k waiting 
slots at one or more of nodes {1, 2, . . . , n — 1} is the same 
as number of ways putting k balls in n — 1 distinct bins with 
no restrictions on the number of balls in a particular bin, and 
this is given by 2 ) ■ Note that the only reason the packet 

needs to wait for a slot at node j is if the edge + 1) is 
inactive at that time instant. This contributes to the p k term. 

It can be verified that E[T] = Y™_ n kP(T = k) = 



(n — which is consistent with Corollary 3.6 Also, the 

variance is given by: Var[T] = E[T 2 ]~E[T] 2 = (n-l)±=£. 
Not surprisingly the mean time elapsed when using the CuT 
metric is smaller than that in case of the So A metric. 

3.3 The (q, p) -Markov Model 

Now we study routing on dynamic line graphs 
MC(po,q,p, L n ), where po is the probability of an edge 
existing in the first graphlet. 

Observation 3.7. It is easy to see that this Markov chain has 
a stationary distribution Tr = (7r OIl , 7r //) = (j^; p+<j)' ^° 
eliminate the effect of transients, we assume that the Markov 
chain has converged (or mixed) before node 1 sends the mes- 



sage to node n; in other words, po = n 



v 

p+q' 



2 A useful metaphor would be that of light passing through an intermittently 
connected network. The time scales of disruption are much lower than those 
of light traversing a connected component. 
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Figure 4: Analyzing the CuT latency in a dynamic random 
line graph 

Observation 3.8. In CuT the expected advance on an infinite 
line is upper-bounded by |. 

Observation 3.9. In SoA the expected advance is 

Corollary 3.7. In CuT the expected time for the routing time 
is plpHy [Proof omitted] 

Corollary 3.8. In SoA the expected time for the routing time 
is n — 1 + p(p+q) ■ [Proof omitted] 



CuT latency Figure [4] illustrates a sample path from 1 — >• n 
over tim^J There are several such paths possible depending 
on the state of the edges, and the computation here is more 
involved than the ER(n,p, L n ) case. 

Any path through this space-time can be characterized by its 
constituent segments: {1, ki,ti, fe, t%, . . . , k m ,t m , n}, where 
t m = t. Clearly 1 < m < min(n — 1, t — 2). 

Let X^. correspond to a binary random variable that denotes 
the status of edge e at time instant r. The probability that path 
P = {1, k%,ti, &2, i2) • • • , k m , t m , n} exists is given by the 
following: 



Pr{P}=Pr{X\ ,Xl...,X k 1 i-\X k 1 \X k 2 \ 

vkm \rn— 1~1 

. . . , A; , . . . , A t } 

=p(xl)---p(x^- 1 )p(x kl ,x k 2 \. 

■ ■■Pix^Pixr 1 ) 



V" 1 vk\ 



(4) 

fel \ 



A tl-1' A tl 



(5) 



= ".".,', *7To//(l -pY 1 2 pX 



on 



3 We present CuT before SoA since the former is easier to explain, and we 
will reuse the analysis technique for the latter, later on. 
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P \ n— rn— 1 / 

> + p + <? 



(p + q)"- 1 



where p > 0, g > 



(6) 
(7) 

(8) 
(9) 



time 



Equation [5] follows from Eq. |4]by using the fact that prob- 
abilities of statuses of various edges are independent of each 
other. However, the probability of existence of an edge (say 
ki) at successive time instants are related by the Markov chain 
parameters, p and q. Therefore, we have: 



A ti-1) A ti 



T off 



(l-p)<- 



(10) 



For each segment corresponding to waiting, the probability 



of the existence of that segment is given by Equation 10 Using 
the fact that there exist m such "wait" segments and m "cut- 
through" segments, Eq. |7]can be simplified from Eq. 1 1 



Let the number of paths that have exactly m bends be N m . 
We observe that a path may be generated by independently 
choosing m bending points each on the space and time axes. 
The number of ways of doing so are and ('_,) respec- 

tively. Hence N rn = (*n-i)- Therefore, the latency 

probability distribution forp > 0, q > is given by: 



P(T = t 

n-l 



i) 



n — 1 
rn 



t- 
m 



1 g m (l-p)* 



(P + q) r 



(11) 
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Figure 5: Analyzing the So A latency in a dynamic random 
line graph 

shaded parallelogram; these segments eventually reach (n,t). 
The width of this parallelogram is t — n. 

We borrow the techniques used in the CuT probability com- 
putation previously and note that paths with m waiting points 
are possible inside this parallelogram with 1 < m < min(n — 
l,t — n). Using similar techniques as the CuT computation, 
we can compute the probability of a certain path P inside the 
parallelogram with m waiting points (or "bends") as follows: 



Pr{P}=< n 



'off 



{l-pf- 



n — m pin 



(12) 



If p + q = 1, the Markov chain reduces to the indepen- 
dent ER(n,p, L n ) scenario, and it can be verified that Equa- 
tion [TT| reduces to Equation [3] (with k substituted for t — 1). 
We have also numerically verified that for p — » 1, q 1, 
E[T] = \ (n — 1), in agreement with Observation '. 
the general case is in agreement with Corollary 3 



3.3 
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So A latency Figure [5] illustrates the latency under the So A 
model. Since each forwarding action to the neighbor costs a 
time slot, the latency t — 1 obeys 4—1 > n — 1, with the 
best case scenario being the diagonal green path from (1,1) 
to (n,t). Hence if we want to compute P(T = t — 1), we 
have to consider all paths that use the diagonal "forward" seg- 
ments and vertical "wait" segments, and are contained in the 



4 We note that this technique can be used in the probability computation 
for the case where each edge e has a different (q e , p e ) . The expression|7]will 
then exhibit a much more complicated product form. 



Let the number of paths that have exactly m waiting points 
(or "bends") be N m . Since a path may be generated by in- 
dependently choosing m bending points each on the diagonal 
and vertical axes of the parallelogram, N m = ("Z ) 
Therefore, the latency probability distribution forp > 0, q > 
is given by: 

P(T = 4-1) 



m'm(n—l.t — n) 



E 



n-l\ft-n - \\p n - 1 q m {l~p) t - n - m 
m J\ m— 1 / (p + '?) ,l ~ 1 

(13) 



where t > n. If p + q = 1, the Markov chain reduces to the 
independent ER(n,p, L n ) scenario, and it can be verified that 
Equation [l3]reduces to Equation [2] We have also numerically 
verified that forp — > 1, q — > 1, E[T] = | (n— 1), in agreement 
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with Observati on|3.4| and that the general case is in agreement 
with Corollary |3.8| 

4 Stacked and Smashed Representa- 
tions of Temporal Graphlets 

Since there is a solid theory of traditional non-temporal graphs, 
an obvious question to ask is if the study of some temporal 
properties may be reduced to studying the same property on an 
equivalent single non-temporal graph. We consider two such 
representations - the stacked graph (StG) and the smashed 
graph (SmG). A stacked graph is constructed by drawing di- 
rected edges in the direction of time between successive tem- 
poral graphlets in a TGS; a smashed graph is a "collapsed" 
version of the stacked graph. Alternatively, it is union of the 
TGs. Clearly, an SmG is a "lossy" version. However, it is far 
more succinct, and therefore it would be interesting to know 
when, if at all, it will suffice. 

The study of such "reducibility" is helpful in that it will 
allow us to use well-known graph-theoretic algorithms (and 
code) on the appropriate representation to easily evaluate 
whether properties such as reachability, connectivity etc. hold. 

We note that the "evolving graph" representation proposed 
in fl2) which labels edges with the times at which they are 
active is equivalent to the stacked graprj^] but an evolving 
graph not a traditional graph. Hence reducing to an evolv- 
ing graph does not allow us to easily leverage existing algo- 
rithms or code. It is imaginable that a smashed graph (or its 
m-smashed variant defined later) can be used to quickly an- 
swer on-line queries for graph properties in massive tempo- 
ral graphs even though such queries may only be answered 
approximately. Therefore, it is interesting and worthwhile to 
compare the complexity vs. accuracy tradeoffs of smashing 
for various temporal graphs. 

4.1 Definitions and Basic Properties 

We begin with some definitions. 

Definition 4.1. Given a temporal graphlet sequence G[l, T], 
the stacked graph (StG) o/G[l, T] is StG = (V s , E s ), where 
Vs = U t V(t), Es = U t E(t) U Eq where E c is a set of "cross 
edges" connecting vertices of adjacent (in time) graphlets. 
That is, E c = U M (wj(i), m(t + 1)). 

Definition 4.2. Given a temporal graphlet sequence G[l, T], 
the smashed graph (SmG) of G[1,T\ is SmG = (V M ,E M ), 
where each sequence of u(i),u(t + 1), . . . is replaced by a 
single vertex u G Vm, and Em = U t E(t) with endpoints of 
edges mapped to the replaced vertices in Vm- 

Definition 4.3. Given a temporal graphlet sequence G[l, T], 
the m-smashed graph (m-SmG) of G[l, T] is m-SmG = 
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And deleting the labels yields a smashed graph. 



Figure 6: Various representations of temporal graphlets for 
the TGS in Fig. [T] 

(Vm, Em), where the smashing operation is not performed 
on the entire G[1,T] but on each of G[l,ra],G[m + 
1, 2m], G[2m + 1, 3m], . . . instead. 

The various aforementioned representations of the temporal 
graphlet sequence shown in Figure [T] are illustrated in Figure 
[6] As mentioned earlier, the StG and Ferreira's evolving graph 
model are equivalent in terms of information content. On the 
contrary SmG is lossy since temporal ordering information is 
lost during smashing of graphlets. This can result in some 
false positives (e.g., in the smashed graph, e — > b is a valid 
spatio-temporal path, whereas that is not the case in reality). 

The technique of m-smashing tries to balance the tradeoffs 
between StG (or evolving graphs) and SmG by restricting the 
smashing to a smaller number of graphlets at a time. For ex- 
ample, in Figure [6] the first two graphlets are smashed into 
one, and the result is stacked with the third graphlet. Note that 
some false positives that were deduced from the SmG (e.g., 
e — > b and e — > d) disappear in m-SmG. However, some other 
false positives such as c — > b still remain. 

We note that StG and SmG are non-temporal, or traditional 
graphs. Consider a property P (definitions of some basic prop- 
erties studied in this paper are in Table [TJ. Can the question 
of whether P is true in G[l, T] be answered by evaluating P 
on StGl If we can, we call such a property stacked-graph 
reducible (StG- reducible). Similarly, if it can be answered by 
evaluating P on SmG then we call it smashed-graph reducible 
(SmG- reducible). 

Definition 4.4. Let P(H) be a function denoting the value 
(including true/false) of a property P on a structure H where 
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Figure 7: Probability distribution of reachability on a 
ER(n,p,L n ) TGS. SmG(all) corresponds to SmG and 
SmG(m) correspond to m-SmG. 

H could be a temporal graphlet sequence or a graph. Then, 
property P is StG-reducible iff P(G[1,T])) = P(StG)), and 
P is SmG-reducible iffP(G[l, T])) = P(SmG)). 

We first consider StG-reducibility. We note that some prop- 
erties such as clique are not "well formed" for directed graphs. 
In such a case, we admit the use of the undirected version, that 
is, if P is evaluated on StG by simply ignoring the direction 
of the edges. We now consider a few properties. 

Observation 4.1. T- reachability is StG-reducible. This is be- 
cause the cross edges are tantamount to the "store " action. 

Observation 4.2. T-clique is not StG reducible. . 

Observation 4.3. T-k-connectivity is StG-reducible if and 
only if k = 1. 

Proof. That 1 -connectivity is StG-reducible follows by re- 
peated application of Observation 4.1 That 2-connectivity 



is not SG-reducible is illustrated by the "temporal triangle" 
which is defined as follows: V(l) = V(2) = V(3) = 
{a,b,c}, E(l) = (a, b), E(2) = (b,c), E(3) = (c,o). G[T] 
is 2-connected, but in Gs the cross edge between c(l) and 
c(2) is a bridge. It is easy to see that this is extensible to 3- 
connectivity in temporal K4 and so on. □ 

We now consider Smashed Graphs (SmG) and SmG- 
reducibility. Since SmG is lossy, it is clear that for the arbitrary 
case it is not reducible. However, there are two questions: 1) 
how close can we come? 2) are there special cases when it is 
reducible? The first question is the subject of later sections, 
here we state some simple results. 

Observation 4.4. T-clique is SmG-reducible. 

Observation 4.5. T-reachability is SmG-reducible if either of 
the following holds: (a) there is some G(t=T) that is identical 
to SmG; (b) there do not exist G(t) and G(t+1) such that the 
number of connected components increases. 



Consider observation |4.5| a) when the identical graphlet ei- 
ther occurs as the first or last in the sequence. 

Corollary 4.1. T-reachability is SmG-reducible if either (a) 
no edges are ever added; (b) no edges are ever deleted. 



Observation 4.5 allows arbitrary additions and deletions, but 
in a manner that preserves reachability. In practice, these con- 
ditions are easily checkable on a sequence of TGs and if they 
"pass", we can use the SmG as a way to get the value of graph 
theoretic properties such as reachability, clique, etc. 

The StG- and SmG-reducibility of numerous other graph 
theoretic concepts is interesting and open. 

4.2 Probabilistic Analysis of Smashing 

We analyze the properties of stacking and smashing on a ran- 
dom TGS constructed from a sequence of random Erdos-Renyi 
line graphs given by {ER(n,p, L n )}f =1 . 

The probability that a path of latency t time slots (under 
CuT metric) exists from node 1 to n is given by Eq. [5] Hence 
the probability that node 1 can reach node n within T = t 
graphlets is given by: 



t-i 



P(T <t) = J2 



r=0 



r - 2 



(i-p)V 



(14) 



If all the T graphlets are smashed into a single graph, then 
we can compute the probability of existence of a path from 
node 1 to n on the smashed graph SmG. (j, j + 1) € SmG iff 
3G, e {Gi, G 2 , . . . , G T } s.t. + 1) e G l . The probability 
of this happening is given by: 



Pr{(j,j + 1) e SmG} 



Pr{(j,j 

(i-pY 
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SmG} 



Therefore, the probability that a path exists within T 
graphlets is given by the following (since all n — 1 edge prob- 
abilities are independently distributed): 

P(T S mG <t) = (1-(1 -pfY 1 - 1 (15) 

If we decide to smash m graphlets at a time into one but 
preserve the rest of the stacked structure, then we have — 
graphlets instead of T. The probability of existence of an edge 
in any of these smashed graphlets is 1 — (1 — p) m . Hence 
the probability distribution of the existence of a path in an m- 
smashed TGS is given by: 



r>(rp(m) 



<*) 



T = 



r-2 



(16) 

Figure [7] illustrates the probability distributions of stacked, 
smashed, and m-smashed graphlets. We can observe that 
while smashed graphs yield only a crude upper bound on the 
real probabilities (i.e. stacked graphs), the procedure of m- 
smashing is useful since it can yield probability distributions 



Table 1: Examples of temporal graph properties 


T-* Property 


Definition 


T-adjacent(u, v) 


3(u,v)€G[l,T\ 


T-reachable(ii, v) 


3{(u, Vl ),...,(v k ,v)} £G[1,T] 


T-clique 


max{X\X C V(l),V ViV G X, T-adjacent(w, v)} 


T-fc-connected 


V5 = {vi, . . . , Vk-i} & Vs, if S is removed, 
V Ujt , T-reachable(w, v) 




5 ia is 

Number af icmpfif b! grsphtas 

Figure 8: Fraction of reachable node pairs in MC(n — 
20, p = 0.005, p = 0.5, q = 0.05) TGS. Squares correspond 
to SmG and circles to StG 



that are much better upper bounds especially for low values of 
m. For graphs where there exist multiple potential paths be- 
tween source and destination, this process is likely to be even 
more useful. 

While we have only shown the ER(n,p, L n ) scenario 
for the CuT scenario here, it is easy to extend it to the 

in 



MC(n,po,p,q, L n ) scenario, since one can apply Eq. 13 
this setting to compute the probability of existence of a path 
within T = t units of time. The probability of existence of an 
edge in a smashed graphlet in this model can is given by: 



P{T S mG < t) = (1 - 



P 



-(i-pr 1 ) 



t-l\n-l 



(17) 



We omit further details due to paucity of space. 

The effect of smashing was also investigated on another 
derivative metric, namely, the number (or fraction) of reach- 
able pairs over a given time budget T for G u — K n . We 
found by simulations that while the gap between SmG an 
StG was large for the ER(n,p) scenario (hence motivating 
m-smashing as shown earlier), for the MC(po, q,p) scenario, 
there were parameter values for which the gap was much lower 
(as exemplified by Fig. [8]). A thorough analysis of the (q,p) 
parameter space with respect to the reachability metric is a 
topic of future research. 



5 Adaptive Routing in Dynamic Ran- 
dom Graphs 

Traditional shortest paths problems attempt to find a path of 
minimum total distance from s to t and are solved by clas- 
sical algorithms that satisfy the suboptimal path property [6| 
(e.g., Dijkstra's and Bellman-Ford). Shortest paths, routing, 
and related problems have been considered in various stochas- 
tic models (when edges disappear permanently, when they do 
so periodically, etc.; see (T||2J[8][TT|[T5]-[T7)) but to the best 
of our knowledge, have not before been studied in the model 
considered here. Specifically, we consider the _Ei?'(n,p, G u ) 
model with SoA. In the adaptive generalization of the shortest 
paths concept to temporal graphs (sometimes called "next-hop 
routing"), the task is to choose, at each routing stage, the best 
neighbor to route to, if any, in order to minimize the remaining 
expected travel tim^] We solve the problem optimally, using a 
variant of Dijkstra's algorithm. Because in the adaptive setting 
we make a routing decision adaptively, each time we arrive at 
a node, based on its current set of outgoing edges, the algo- 
rithm performs its computation going from t to s, rather than 
from s to t. To motivate the algorithm, we make the following 
observations. 

Observation 5.1. In an unweighted graph, an optimal move 
from a neighbor v of t is to remain at v until the edge (v, t) 
appears, and then traverse it. 

Proof. Since traversing the edge takes one birth/death time 
step, the likelihood of being able to traverse edge (v, t) at the 
next time step is the same as that of being able to traverse 
(v' , t) for some mutual neighbor v'. □ 

Corollary 5.1. In a weighted graph, an optimal move from t's 
nearest neighbor v will be to remain at v until the edge (v, t) 
appears. 

Observation 5.2. In an optimal adaptive routing path, there 
will without loss of generality be no backtracking. 

Proof. Suppose in an optimal solution, we move from node v 
to u. Assume we move only when it gives us a strict improve- 
ment, i.e., METT[u] < METT[v]. In that case, once at u, 
we will never move to a node with expected remaining travel 
time greater than u's. □ 



In some cases, it may be best to remain at the current node for another 
birth/death time step, and then reevaluate. 
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Given this, the optimal deterministic routing algorithm sim- 
ply moves greedily in order to decrease the remaining mini- 
mum expected traversal time (METT): at time step, move 
from the current node v to a neighbor u e N(v) of minimum 
METT(u) from t, if there is one such that METT{u) < 
METT(v); otherwise, remain at node v until the next time 
step. This algorithm assumes an oracle to compute METT(v) 
for each node v, which is done by Algorithm[T] 

Algorithm 1 Computing minimum expected traversal times 

l: for each v do 

2: METT[v] = oo 

3: end for 

4: METT[t] = 

5; Q «- V 

6: while Q ^ do 

7: u <— extract-min(Q) 

8: for each v € N(u) n Q do 

9: d v <- /(p, {M£TT[«] : 6 e AT(«) - Q}) 

10: if d v < METT[v] then 

11: METT[v] = d v 

12: end if 

13: end for 

14: end while 



Let N(v) indicate the neighbors of v, and f(p, S) indicate 
the function computing the METT from a node v to t, along 
a paf/z w/jose next-hop node is a member of S. Given the ex- 
pected traversal times of the nodes in S, / can easily be com- 
puted in time 0(\N(v)\ log |JV(u)|): sort the neighbor nodes 
v in order of METT[v}. The set of neighbor nodes chosen 
to consider as next-hop candidates in the event that we arrive 
at node u is the prefix of the v sequence that minimizes the 
expected remaining traversal time from v to t. 

Lemma 5.1. Restricted to an available set of nodes S to use as 
next hop, and based on the correct METT values of the mem- 
bers of S, the function f(p, S) correctly computes M ETT[u\. 

Proof. We need a policy that tells us, when offered a set of 
the choices, which we should accept, if any, or whether we 
should instead wait a timestep and try again. Since the graph 
is Erdos-Renyi, a memoryless policy suffices, i.e., we make 
the decision based only on the set of available choices, inde- 
pendent of how long we have spent at the current node. Given 
this, if the cheapest-cost edge is available right now, clearly 
it should be chosen. If an optimal policy says to take the kth 
cheapest edge (among all the potential choices), if it happens 
to be the best available, then it follows that we should also 
take the jth cheapest edge, for 1 < j < k, if it happens to 
be available. Therefore the only thing to determine then is the 
best value k, i.e., the one leading to the policy that minimizes 
M ETT from this node. □ 

Theorem 5.1. Algorithm [7] correctly computes the METT 
values for the SoA model. 



Proof. We prove by induction on the nodes removed from Q. 
The expected traversal time of for t is correct by definition. 
Moreover, by the proof of Corollary |5.1| the expected traversal 
time of t's nearest neighbor it is 1/p + 1, 

Suppose there is at least one node whose computed M ETT 
value is incorrect, i.e., larger than optimaQ Among such 
nodes, let u be one whose true METT value is minimum. 
Note that if Vi is removed before Vj, then METT\vi\ < 
METT[vj}. This follows from the fact that we remove nodes 
by performing extract-min operations, and that the function 
f(p,-) is non-decreasing. There must be at least one path from 
u to t, i.e., u must have at least one neighbor whose true opti- 
mal expected time to t is strictly less than u's. In fact, u may 
have several such neighbors. Call them vi, ...,vg. If the true 
METT values of Ui , Vi are all smaller than u's. then by 
the induction assumption their METT values are correct, and 
hence so is u's. 

Now suppose some such m has not yet been removed. This 
implies that its computed METT value will be at least u's, 
even though t'i's true M ETT value is smaller than u's, which 
contradicts the induction hypothesis. □ 

We can now redefine the CuT model as the one in which 
all edge weights are 0. The effect of this is that v neighbor 
set N(v) is replaced in the routing algorithm with the set of 
all nodes reachable from v, since upon arrival at u, we can 
consider cutting through instantly to any node that 1) would 
be an improvement over u and 2) to which there currently is 
an accessible path from u. 

Corollary 5.2. Modified appropriately, Algorithmic or re ctly 
computes the optimal traversal times for the CuT model, as 
well as for the nonnegative integer-weighted model subsuming 
SoA and CuT. 

Proof. An oracle to compute the probability that there exists 
an edge between v and u, for all pairs (v, u) can be computed 
in polynomial time by dynamic programming. We omit the 
details due to lack of space. □ 

6 Discussion and Future Work 

This paper marks the first step toward a research program 
aimed at developing a theory of temporal graphs from both 
stochastic and deterministic (or classical) points of view. We 
plan to develop the research program in multiple directions. 
First, the probability distribution results in Sec. [3] need to be 
extended beyond simple scenarios such as dynamic random 
path, especially to scenarios where there are multiple possi- 
ble (intermittent) paths between the source and the destina- 
tion. Second, in addition to the T-* properties discussed in 

7 Note that the M ETT computation always corresponds to a collection of 
paths from u to t; a traversal strategy from u to t restricted to such a path 
collection can only have expected cost greater or equal to the optimal. 
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Sec. [4] other properties such as chromatic number, indepen- 
dent set, and dominating set are worth investigating. One in- 
teresting question is whether m-smashing can be improved by 
a non-uniform choice of m. If the deterministic sequence of 
graphs is known, then this is akin to a compression problem 
where more graphlets will be smashed around times when the 
temporal ordering does not matter much, and less graphlets 
will be smashed around other times, thus preserving the tem- 
poral structure. However, for a given dynamic random graph 
model, it may be interesting to develop rules of thumb for non- 
uniform smashing. In addition to graphlet union (or SmG), 
graphlet intersection can be interesting since it can be used to 
quantify redundancy in spatio-temporal paths in a TGS. 
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